Adaptive Sensing in Uncertain Environments: Maximum Likelihood, Sensor Networks, and Reinforcement Learning
نویسندگان
چکیده
The advent of distributed and agile sensing systems that collect data in multiple locations and through a variety of sensing modalities has brought about new and exciting challenges to the field of signal processing. Motivated by problems that arise in the development of these systems, the thesis makes contributions in three domains: (1) distributed optimization for inference in sensor networks, (2) statistical tests for optimality that mitigate the problem of sensitivity to local maxima, and (3) development and analysis of reinforcement learning solutions to stochastic decision problems for resource allocation in agile sensing. A novel incremental gradient method, called incremental aggregated gradient (IAG), that can be used by wireless sensor networks to perform inference in a distributed manner, is proposed and analyzed. A gradient aggregation concept relaxes the common requirement of incremental methods for a diminishing step size for convergence, and a fast convergence rate is established. The convergence of IAG is established under a certain unimodality assumption. For non-convex problems however, for example when IAG is applied to find the maximum likelihood estimator, the method might stagnate at a local maximum. To mitigate this weakness, the following question is addressed: Given the location of a relative maximum of the log-likelihood function, how to assess whether it is the global maximum? We analyze and improve an existing statistical tool, called A Test for Global Maximum, that answers this question by posing it as a hypothesis
منابع مشابه
Multicast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach
Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملOn the effect of low-quality node observation on learning over incremental adaptive networks
In this paper, we study the impact of low-quality node on the performance of incremental least mean square (ILMS) adaptive networks. Adaptive networks involve many nodes with adaptation and learning capabilities. Low-quality mode in the performance of a node in a practical sensor network is modeled by the observation of pure noise (its observation noise) that leads to an unreliable measurement....
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کامل